智能论文笔记

A Primer on Topological Data Analysis to Support Image Analysis Tasks in Environmental Science

Lander Ver Hoef , Henry Adams , Emily J. King , Imme Ebert-Uphoff

分类：机器学习 | 计算机视觉

2022-07-21

拓扑数据分析（TDA）是来自数据科学和数学的工具，它开始在环境科学领域引起波浪。在这项工作中，我们寻求对TDA工具的直观且可理解的介绍，该工具对于分析图像（即持续存在同源性）特别有用。我们简要讨论理论背景，但主要关注理解该工具的输出并讨论它可以收集的信息。为此，我们围绕着一个指导示例进行讨论，该指导示例是对RASP等人研究的糖，鱼类，花朵和砾石数据集进行分类。 al。 2020年（Arxiv：1906：01906）。我们证明了如何使用简单的机器学习算法来获得良好的结果，并详细探讨了如何用图像级特征来解释这种行为。持续同源性的核心优势之一是它的解释性是可解释的，因此在本文中，我们不仅讨论了我们发现的模式，而且要考虑到为什么我们对持续性同源性理论的了解，因此可以期待这些结果。我们的目标是，本文的读者将更好地了解TDA和持续的同源性，能够确定自己的问题和数据集，为此，持续的同源性可能会有所帮助，并从应用程序中获得对结果的理解包括GitHub示例代码。

translated by 谷歌翻译

Support vector machines and Radon's theorem

Henry Adams , Elin Farnell , Brittany Story

分类：机器学习

2020-11-01

支持向量机（SVM）是一种算法，该算法找到了超平面，最佳地将标记的数据点以$ \ mathbb {r} ^ n $分为正面和负类。该分离超平面裕度上的数据点称为支持向量。我们将支持向量的可能配置连接到Radon的定理，这提供了一组点可以分为两个类（正负）的保证，其凸壳相交。如果将正和负支持向量的凸壳投射到分离超平面上，则仅在超平面是最佳的，则投影在至少一个点中相交。此外，通过特定类型的一般位置，我们表明（a）支撑载体的投影凸船体在恰好一个点中相交，（b）支撑载体在扰动下稳定，（c）最多有$ n + 1 $支持向量，（d）每一个高达$ n + 1 $的支持向量是可能的。最后，我们执行研究预期的支持向量数及其配置的计算机模拟，用于随机生成的数据。我们观察到，随着该类型的随机生成的数据增加的距离增加，具有两个支持向量的配置成为最可能的配置。

translated by 谷歌翻译

Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security Checkpoints

Abubakar Siddique , Henry Medeiros

分类：计算机视觉

2022-12-31

We introduce a novel framework to track multiple objects in overhead camera videos for airport checkpoint security scenarios where targets correspond to passengers and their baggage items. We propose a Self-Supervised Learning (SSL) technique to provide the model information about instance segmentation uncertainty from overhead images. Our SSL approach improves object detection by employing a test-time data augmentation and a regression-based, rotation-invariant pseudo-label refinement technique. Our pseudo-label generation method provides multiple geometrically-transformed images as inputs to a Convolutional Neural Network (CNN), regresses the augmented detections generated by the network to reduce localization errors, and then clusters them using the mean-shift algorithm. The self-supervised detector model is used in a single-camera tracking algorithm to generate temporal identifiers for the targets. Our method also incorporates a multi-view trajectory association mechanism to maintain consistent temporal identifiers as passengers travel across camera views. An evaluation of detection, tracking, and association performances on videos obtained from multiple overhead cameras in a realistic airport checkpoint environment demonstrates the effectiveness of the proposed approach. Our results show that self-supervision improves object detection accuracy by up to $42\%$ without increasing the inference time of the model. Our multi-camera association method achieves up to $89\%$ multi-object tracking accuracy with an average computation time of less than $15$ ms.

translated by 谷歌翻译

Weakly-Supervised Semantic Segmentation of Ships Using Thermal Imagery

Rushil Joshi , Ethan Adams , Matthew Ziemann , Christopher A. Metzler

分类：计算机视觉

2022-12-26

The United States coastline spans 95,471 miles; a distance that cannot be effectively patrolled or secured by manual human effort alone. Unmanned Aerial Vehicles (UAVs) equipped with infrared cameras and deep-learning based algorithms represent a more efficient alternative for identifying and segmenting objects of interest - namely, ships. However, standard approaches to training these algorithms require large-scale datasets of densely labeled infrared maritime images. Such datasets are not publicly available and manually annotating every pixel in a large-scale dataset would have an extreme labor cost. In this work we demonstrate that, in the context of segmenting ships in infrared imagery, weakly-supervising an algorithm with sparsely labeled data can drastically reduce data labeling costs with minimal impact on system performance. We apply weakly-supervised learning to an unlabeled dataset of 7055 infrared images sourced from the Naval Air Warfare Center Aircraft Division (NAWCAD). We find that by sparsely labeling only 32 points per image, weakly-supervised segmentation models can still effectively detect and segment ships, with a Jaccard score of up to 0.756.

translated by 谷歌翻译

Semantically-consistent Landsat 8 image to Sentinel-2 image translation for alpine areas

M. Sokolov , J. L. Storie , C. J. Henry , C. D. Storie , J. Cameron , R. S. Ødegård , V. Zubinaite , S. Stikbakke

分类：计算机视觉 | 机器学习

2022-12-22

The availability of frequent and cost-free satellite images is in growing demand in the research world. Such satellite constellations as Landsat 8 and Sentinel-2 provide a massive amount of valuable data daily. However, the discrepancy in the sensors' characteristics of these satellites makes it senseless to use a segmentation model trained on either dataset and applied to another, which is why domain adaptation techniques have recently become an active research area in remote sensing. In this paper, an experiment of domain adaptation through style-transferring is conducted using the HRSemI2I model to narrow the sensor discrepancy between Landsat 8 and Sentinel-2. This paper's main contribution is analyzing the expediency of that approach by comparing the results of segmentation using domain-adapted images with those without adaptation. The HRSemI2I model, adjusted to work with 6-band imagery, shows significant intersection-over-union performance improvement for both mean and per class metrics. A second contribution is providing different schemes of generalization between two label schemes - NALCMS 2015 and CORINE. The first scheme is standardization through higher-level land cover classes, and the second is through harmonization validation in the field.

translated by 谷歌翻译

Spoken Language Understanding for Conversational AI: Recent Advances and Future Direction

Soyeon Caren Han , Siqu Long , Henry Weld , Josiah Poon

分类：自然语言处理 | 人工智能

2022-12-21

When a human communicates with a machine using natural language on the web and online, how can it understand the human's intention and semantic context of their talk? This is an important AI task as it enables the machine to construct a sensible answer or perform a useful action for the human. Meaning is represented at the sentence level, identification of which is known as intent detection, and at the word level, a labelling task called slot filling. This dual-level joint task requires innovative thinking about natural language and deep learning network design, and as a result, many approaches and models have been proposed and applied. This tutorial will discuss how the joint task is set up and introduce Spoken Language Understanding/Natural Language Understanding (SLU/NLU) with Deep Learning techniques. We will cover the datasets, experiments and metrics used in the field. We will describe how the machine uses the latest NLP and Deep Learning techniques to address the joint task, including recurrent and attention-based Transformer networks and pre-trained models (e.g. BERT). We will then look in detail at a network that allows the two levels of the task, intent classification and slot filling, to interact to boost performance explicitly. We will do a code demonstration of a Python notebook for this model and attendees will have an opportunity to watch coding demo tasks on this joint NLU to further their understanding.

translated by 谷歌翻译

Importance of Synthesizing High-quality Data for Text-to-SQL Parsing

Yiyun Zhao , Jiarong Jiang , Yiqun Hu , Wuwei Lan , Henry Zhu , Anuj Chauhan , Alexander Li , Lin Pan , Jun Wang , Chung-Wei Hang

分类：自然语言处理

2022-12-17

Recently, there has been increasing interest in synthesizing data to improve downstream text-to-SQL tasks. In this paper, we first examined the existing synthesized datasets and discovered that state-of-the-art text-to-SQL algorithms did not further improve on popular benchmarks when trained with augmented synthetic data. We observed two shortcomings: illogical synthetic SQL queries from independent column sampling and arbitrary table joins. To address these issues, we propose a novel synthesis framework that incorporates key relationships from schema, imposes strong typing, and conducts schema-distance-weighted column sampling. We also adopt an intermediate representation (IR) for the SQL-to-text task to further improve the quality of the generated natural language questions. When existing powerful semantic parsers are pre-finetuned on our high-quality synthesized data, our experiments show that these models have significant accuracy boosts on popular benchmarks, including new state-of-the-art performance on Spider.

translated by 谷歌翻译

Acela: Predictable Datacenter-level Maintenance Job Scheduling

Yi Ding , Aijia Gao , Thibaud Ryden , Kaushik Mitra , Sukumar Kalmanje , Yanai Golany , Michael Carbin , Henry Hoffmann

分类：机器学习

2022-12-10

Datacenter operators ensure fair and regular server maintenance by using automated processes to schedule maintenance jobs to complete within a strict time budget. Automating this scheduling problem is challenging because maintenance job duration varies based on both job type and hardware. While it is tempting to use prior machine learning techniques for predicting job duration, we find that the structure of the maintenance job scheduling problem creates a unique challenge. In particular, we show that prior machine learning methods that produce the lowest error predictions do not produce the best scheduling outcomes due to asymmetric costs. Specifically, underpredicting maintenance job duration has results in more servers being taken offline and longer server downtime than overpredicting maintenance job duration. The system cost of underprediction is much larger than that of overprediction. We present Acela, a machine learning system for predicting maintenance job duration, which uses quantile regression to bias duration predictions toward overprediction. We integrate Acela into a maintenance job scheduler and evaluate it on datasets from large-scale, production datacenters. Compared to machine learning based predictors from prior work, Acela reduces the number of servers that are taken offline by 1.87-4.28X, and reduces the server offline time by 1.40-2.80X.

translated by 谷歌翻译

Closed pattern mining of interval data and distributional data

Henry Soldano , Guillaume Santini , Stella Zevio

分类：人工智能 | 机器学习

2022-12-09

We discuss pattern languages for closed pattern mining and learning of interval data and distributional data. We first introduce pattern languages relying on pairs of intersection-based constraints or pairs of inclusion based constraints, or both, applied to intervals. We discuss the encoding of such interval patterns as itemsets thus allowing to use closed itemsets mining and formal concept analysis programs. We experiment these languages on clustering and supervised learning tasks. Then we show how to extend the approach to address distributional data.

translated by 谷歌翻译

Specifying Behavior Preference with Tiered Reward Functions

Zhiyuan Zhou , Henry Sowerby , Michael L. Littman

分类：机器学习 | 人工智能

2022-12-07

Reinforcement-learning agents seek to maximize a reward signal through environmental interactions. As humans, our contribution to the learning process is through designing the reward function. Like programmers, we have a behavior in mind and have to translate it into a formal specification, namely rewards. In this work, we consider the reward-design problem in tasks formulated as reaching desirable states and avoiding undesirable states. To start, we propose a strict partial ordering of the policy space. We prefer policies that reach the good states faster and with higher probability while avoiding the bad states longer. Next, we propose an environment-independent tiered reward structure and show it is guaranteed to induce policies that are Pareto-optimal according to our preference relation. Finally, we empirically evaluate tiered reward functions on several environments and show they induce desired behavior and lead to fast learning.

translated by 谷歌翻译